Skip to content

Conversation

LucasWilkinson
Copy link
Collaborator

@LucasWilkinson LucasWilkinson commented Sep 30, 2025

In the final release DeepseekV32 was being registered separately so the config override was not getting picked-up anymore

Signed-off-by: Lucas Wilkinson <[email protected]>
@mergify mergify bot added the deepseek Related to DeepSeek models label Sep 30, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug where the specific configuration for DeepseekV3.2 models was not being applied. The changes involve renaming the configuration class to DeepseekV32ForCausalLM and updating the model configuration map accordingly. The logic for applying the custom KV cache settings has also been simplified. While the changes are generally good, I've identified a potential issue in the handling of the bfloat16 cache data type that could lead to unexpected behavior.

@LucasWilkinson LucasWilkinson changed the title [BugFix] Fix default kv-cache-default for DeepseekV3.2 [BugFix] Fix default kv-cache-dtype default for DeepseekV3.2 Sep 30, 2025
@LucasWilkinson LucasWilkinson added this to the v0.11.0 Cherry Picks milestone Sep 30, 2025
Signed-off-by: Lucas Wilkinson <[email protected]>
@heheda12345
Copy link
Collaborator

CC @youkaichao @zyongye

@simon-mo simon-mo merged commit 8d7da92 into main Oct 1, 2025
9 checks passed
@simon-mo simon-mo deleted the lwilkinson/fix-defaults branch October 1, 2025 04:58
simon-mo pushed a commit that referenced this pull request Oct 1, 2025
pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025
yewentao256 pushed a commit that referenced this pull request Oct 3, 2025
tomeras91 pushed a commit to tomeras91/vllm that referenced this pull request Oct 6, 2025
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025
shyeh25 pushed a commit to shyeh25/vllm that referenced this pull request Oct 14, 2025
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

deepseek Related to DeepSeek models

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants